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Abstract — We consider a two-user Gaussian multiple access 
channel with two independent additive white Gaussian interfer- 
ences. Each interference is known to exactly one transmitter 
non-causally. Transmitters are allowed to cooperate through 
finite-capacity links. The capacity region is characterized to 
within 3 and 1.5 bits for the stronger user and the weaker 
user respectively, regardless of channel parameters. As a by- 
product, we characterize the capacity region of the case without 
cooperation to within 1 and 0.5 bits for the stronger user 
and the weaker user respectively. These results are based on a 
layered modulo-lattice transmission architecture which realizes 
distributed interference cancellation. 

I. Introduction 

In modern wireless communication systems, interference 
has become the major barrier for efficient utilization of avail- 
able spectrum. In many scenarios, interferences are originated 
from sources close to transmitters and hence can be inferred 
by intelligent transmitters, while receivers cannot due to 
physical limitations. With the knowledge of interference as 
side information, transmitters are able to encode their infor- 
mation against interferences and mitigate them, even though 
the receiver cannot distinguish interferences from the desired 
signal. The simplest information theoretic model for studying 
such interference mitigation is the single-user point-to-point 
dirty-paper channel (TJ, which is a special case of state- 
dependent memoryless channels with the state known non- 
causaully to the transmitter [2 |. It is shown that the effect of 
interference can be completely removed in the additive white 
Gaussian noise (AWGN) channel when the interference is also 
additive white Gaussian [ 1 ]. As for multi-user scenarios, it has 
been found that when perfect state information is available 
non-causally at all transmitters, the capacity region of the 
AWGN multiple access channel (MAC) is not affected by the 
additive white Gaussian interference J4). When the sate 
information is known partially to different transmitters in the 
MAC, however, the capacity loss caused by the interference 
is unbounded as the signal-to-noise ratios increase 0. 
Since each transmitter only has partial knowledge about the 
interference, interference cancellation has to be realized in a 
distributed manner. 

In this paper, we consider an A'-user Gaussian MAC with 
K independent additive white Gaussian interferences. Each 
interference is known to exactly one transmitter non-causally. 



Furthermore, we allow transmitters to cooperate through finite- 
capacity links, so that transmitters can cooperatively transmit 
their messages and/or mitigate the known interferences. This is 
exactly the same model studied in [5 1 except for the transmitter 
cooperation. For simplicity, we mainly focus on the two- 
user case, termed as doubly-dirty MAC in 0. The model is 
depicted in Fig. [T] where 



V = Xi 
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Si + s 2 



and z ~ Af (0, N a ) is the AWGN noise. Interference Si ~ 
Af (0, Qi), i — 1,2, independent of everything else, is known 
non-causally to transmitter i only. Power constraint at trans- 
mitter i is Pi, i = 1,2. Define channel parameters SNR,; := 
Pi/N , INR,; := Qi/N , for i = 1, 2. Transmitter cooperation 
is induced by two orthogonal noise-free links with capacity 
Ci 2 and Cfi, which carry signals ti 2 and £21 respectively. 
User i's rate is denoted by Ri, i — 1,2. Throughout this paper, 
without loss of generality we assume that user 1 has a stronger 
transmission power, that is, Pi > P 2 . 
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Fig. 1. Channel Model 

State-dependent networks with partial state knowledge 
available at different nodes have been studied in various 
scenarios. Kotagiri et al.fll study the state-dependent MAC 
with state non-causally known to only one transmitter, and for 
the Gaussian case they characterize the capacity asymptotically 
at infinite interference (Qi = 00, Q2 = 0, Cf 2 = Cfi = 0) as 
the informed transmitter's power grows to infinity. Somekh- 
Bamch et a/.0 study the problem with the same set-up as 
Q while the informed transmitter knows the other's message 
(Q2 = 0, Ci 2 = 0, C21 = R2), and they characterize the 
capacity region completely. Zaidi et al. [8| study another case 
of degraded message set (Q2 = 0, Cf x = 0, C^ 2 = Ri). The 
achievability part of Q, 0, and are based on random 



binning. Philosof et al. Q, on the other hand, characterize the 
capacity region of the doubly-dirty MAC to within a constant 
gap at infinite interferences (i.e., Qi = Q2 — 00, Cf 2 — Cf x = 
0), by lattice strategies. They also show that strategies based on 
Gaussian random binning is unboundedly worse than lattice- 
based strategies. On the other hand, transmitter cooperation 
has also been widely investigated in various scenarios, and 
a non-exhaustive list includes MAC [ 1 1 ATI , interference 
channels [12| [13], MAC with state known to all transmitters 
iTTTl . and MAC with partial state known to transmitters and 
full state knowledge at the receiver |[T4ll . 

Main Contribution 

We characterize the capacity region of the doubly-dirty 
MAC with transmitter cooperation to within 3 and 1.5 bits for 
Ri and i? 2 respectively. As a by-product, we characterize the 
capacity region of the doubly-dirty MAC without cooperation 
to within 1 and 0.5 bits for i?i and i? 2 respectively, and 
hence extend the constant-gap-to-optimality result in J3] to 
arbitrary interference powers. The constant gap is independent 
of channel parameters and provides a strong guarantee on 
the performance. It turns out that the approximate capacity 
region does not depend on Cf 2 - The cooperation link from the 
stronger transmitter (Txl) to the weaker one (Tx2) is not re- 
quired to achieve the constant-gap-to-optimality performance, 
and it only yields a power gain which is upper bounded by a 
constant that does not increase with C^ 2 . In particular, consider 
the system with C 21 = and Cf 2 = 00. As a consequence of 
the main results, the capacity region of this system is at most 
2 bits per user larger than the capacity region of the system 
without cooperation. 

For the achievability part, we propose a layered superposi- 
tion strategy consisting of three layers - (1) the lattice strategy 
layer £, (2) the cooperation layer £, and (3) the Gaussian 
random binning/relaying layer £H. The hierarchy of the layers 
is £H — ^ C — ^ from the top to the bottom. Each layer 
treats the signals sent at higher layers as interference, each of 
which is known non-causally to exactly one transmitter but 
not both. In layer £, we use a similar lattice strategy as Q 
to realize distributed interference cancellation. Tx2 uses up its 
own power in this layer. In layer £, the weaker transmitter 
Tx2 compresses the precoded information (precoded against 
intereference s 2 ) at a proper distortion, and uses part of the 
cooperation capacity to send the compression index to Txl. 
Then Txl precodes it along with user l's information against 
the aggregate interference at this layer. In layer 91, Tx2 uses 
the rest of the cooperation capacity to send additional data to 
Txl. Txl uses the rest of its power to further transmit its own 
information or relay user 2's information, precoded against si 
using either Gaussian random binning [1| or lattice strategies 
[9[M For the outer bound, we use a similar argument as |15|. 



Notations 

Throughout the paper, the block coding length is denoted by 
N, a sequence of random variables x[l], . . . , x[N] is denoted 
by x N and boldface x interchangeably. Logarithms are of 
base 2 if not specified. We use a short-hand notation C (•) to 
denote | log (1 + ■), (-) + to denote max{0, •}, and log + (•) to 
denote (log (-)) + - I {^4} denotes the indicator function, which 
is evaluated to 1 if event A is true and otherwise. 

II. Without Cooperation 

To better convey the idea of achievability and outer bounds, 
we first deal with the case without cooperation. Outer bounds 
are derived first. Then we describe the scheme that achieves 
the capacity to within a constant gap for arbitrary interference 
variances and transmit powers. 

A. Main Result 

The main result in this section is summarized in the follow- 
ing lemmas and theorem. 

Lemma 2.1 (Outer Bounds Without Cooperation): If non- 
negative (i?i,i? 2 ) is achievable, it satisfies the following: 



R1+R2 <C(SNRi +SNR 2 ) 

1 + SNRi +SNR 2 



Ri+R 2 <C 



INR, 



C(SNR 2 ) 



i? 2 <C(SNR 2 ). 



(1) 

(2) 
(3) 



Proof: See Appendix |A-A 



Lemma 2.2 (Achievable Rate Without Cooperation): If 
nonnegative (Ri,R 2 ) satisfies the following, it is achievable: 



Ri+R 2 < 1 - log H 



R2 < 2 log H 



* +SNR 2 



C 



SNR; 



SNRi - SNR 2 
+ 2SNR 2 + INR 2 
(4) 



(5) 



Proof: Achievability will be detailed in this section. ■ 
Theorem 2.3 (Constant Gap to Optimality): The above in- 
ner and outer bounds are within (1,0.5) bits for (i?i,i? 2 ). 
Proof: We combine Lemma |2.1| and |2.2| to obtain the 



theorem. See Appendix B-A for detailed gap analysis. ■ 

B. Achievability 

We use the rest of this section to establish the achievability 
result in Lemma [2~2] We refer to Q and the references therein 
for preliminary on lattices. 

The scheme consists of two layers: layer D\ and layer £. 
As described in Section [TJ *H stands for Gaussian random 
binning and £ stands for lattice-based strategy. We decompose 
message w\ into (iui«r, w\s) and rename u> 2 as w 2 £. We split 
the encoder at Txl, £\, into (£vr,£\z), split the decoder at 
Rx, T>, into (2?<k, T>^), and rename the encoder at Tx2, £ 2 , 
as £ 2 £. Encoders £1^, £12, and £22 output signals x^, Xi£, 
and x 2 £ respectively. Txl sends out the superposition of x^ 
and xi£. Hence the receive signal can be written as 



'The name "Gaussian random binning layer SK" is to stress that Gaussian 
random binning and lattice strategies are equally good. 



Xi<k + Xi< 



X 2 £ + Si + S 2 + Z. 



Encoding 

1) Layer £: Encoders £is and E2Z use a lattice A £ with second 
moment 0o = P 2 and basic Voronoi region Vs to modulate 
Wig and W2£- Generate random independent codebooks of 
sizes 2 NRl ~ and 2 NR2 ~ according to Unif (Vs.) for Wis 
and u>2£ respectively. Let the codewords be vi£ and V2 £ 
respectively. 

Signals Xi£ and X20 are generated using the following 
modulo-lattice operation: 

x 4 £ = [vj£ - aos i£ - di S ] mod A £> * = ^ 2 : ( 6 ) 

where di£ and d2£, randomly and independently generated 
according to Unif (Vs), are dithers known to the receiver [9|. 
as is the MMSE coefficient 2 q^ No = i+|snr 2 ■ 

Si£ and S20 denote the effective interferences known to 
Txl and Tx2 respectively in this layer: = Si + x^ and 
S2£ = §2- Note that si£ can be produced by the higher layer 
encoder £\<y\. 

2) Layer £H: Layer is only used at Txl for user 1. Encoder 
£-\$k uses power 6^5 = Pi — 0£ to encode message u>i«h, 
using dirty-paper coding against interference 

The encoder architecture at Txl is depicted in Fig. [2] 
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Fig. 2. Encoder Architecture at Txl (without Cooperation) 
Decoding 

1) Layer £: Decoder T>s takes the input y, rewritten as 

y = Xl£ + X 2 £ + Sl£ + S 2 £ 



where {s^£, i = 1, 2} are defined as above and Z£ = z. 

We follow the same decoding procedure and the same line 
of analysis as (5) and (9J- The main idea is to transform 
the original channel into a modulo-lattice MAC, by first 
multiplying y by as, adding the dithers back, and taking 
modulo A£. The resulting output 

y £ = [y - (1 - a s ) y + di£ + d 2£ ] mod A £ 



(a) 



[vis + v 2 £ + Zcff^] mod As, 



where z c g t s = cts^s — (1 — c*.e)(xi£ + x 2£)- (a) is due to 
the distributive property of modulo-lattice operation. Due to 
dithering, z e ff,£ and Vi£,V2£ are independent [5|. Hence the 
achievable rates for the modulo-lattice MAC are nonnegative 
(i?i£,i? 2 £) satisfying 

Rxs + Rvs < ~ log + \ —a tt^ 

2 Va| Ar o + (l-a£) 2 26 1 



This justifies the choice of as being the MMSE coefficient 
, which minimizes the effective noise variance. With 



20r 



20 £ +iV t 

this choice the achievable rates are 



R 



2S 



SNR; 



(7) 



2) Layer 9^: Decoder 2?<h takes the input y, rewritten as 

y = XlJR + BlJR + ZCR, 

where Si^ = Si and z^ = Xi£ + X2£ + S2 + z. 

Note that due to independent dithering, {xi£,Xi£, Si, 82} 
are mutually independent. Hence the aggregate xi £ + X2£+S2 
is indeed independent of the encoding auxiliary codeword in 
dirty-paper coding, and its variance is 20£ + Q 2 = 2P 2 + Q 2 . 
We conclude that zi<k is independent of xi<k and Si^, and its 
variance is N Q + 2P 2 + Q2- The following claim establishes 
a lower bound on the achievable rate when we apply Costa's 
scheme to a dirty-paper channel where the additive noise is 
non-Gaussian: 

Claim 2.4: Consider a dirty -paper channel Y = X + S + Z 
where X has power constraint P, S is zero-mean Gaussian 
with variance Q, Var [Z] = Nz, and {X, S, Z} are mutually 
independent. S is known non-causally to the transmitter. Then 
the achievable rate is at least C (P/Nz). 

Proof: We use an argument similar to the worst-case noise 
property of Gaussian noise in additive noise channel [16|. See 
Appendix [C] for detail. ■ 

Using the above claim, we have the achievable dirty-paper 
coding rate [1| which is evaluated assuming zir^ is Gaussian: 
Rim > satisfying 

e K \ „ / SNR1-SNR2 



Rim < C 



= c 



N a +2P 2 + Q 2 J ' Vl + 2SNR 2 + INR 2 ^ 
Plug in Ri = Ris+Rim and R 2 = R 2 s and apply Fourier- 
Motzkin elimination to the above achievable rate regions, we 



establish the achievability result in Lemma 2.2 

We conclude this section by two remarks. 

Remark 2.5: The resultant achievable region does not have 
a constraint on the individual rate Ry. This is due to the 
geometric structure of the achievable region (a triangle rather 
than a pentagon) in the lattice layer £. 

Remark 2.6: The novelty of the proposed scheme compared 
with that in is the additional layer £H of user l's code 
and the idea of using lattice precoding to remove layer 
in decoder T>s- Therefore, decoders 2?^ and T>s can work 
in parallel. In a general setting where the number of users 
K > 2, the same layered architecture with K layers suffices 
to achieve the capacity region to within a constant gap, where 
the constant only depends on the number of users K. This 
result will be detailed in a follow-up paper. 

III. With Cooperation 

With cooperation, we shall first derive the outer bounds by 
a slight modification of the previous arguments, taking the 
transmitter cooperation into account. Then we add one more 
layer into the previous layered strategy, which is induced by 
the cooperation from Tx2 to Txl, and show that it achieves 
the outer bound to within a constant gap. 



A. Main Result 

The main result in this section is summarized as follows. 
Lemma 3.1 (Outer Bounds With Cooperation): If nonneg- 
ative (Ri,R 2 ) is achievable, it satisfies the following: 



R 1 +R 2 <C (SNRi + SNR 2 + 2--/SNRiSNR- 



f r / l+SNR 1 +SNR 2 +2VSNR 1 SNRo 

Ri + R-><< V INR 2 
[ +C(SNR 2 ) + C B 1 

R 2 <C(SNR 2 ) + C 2 3 1 . 




Proof: See Appendix |A-B 



Lemma 3.2 (Achievable Rate With Cooperation): If 
nonnegative (Ri, R2) satisfies the following, it is achievable. 



R2<\ l0g H 



* +SNR 2 

Pi - e e 



c 



N + 2P 2 



n + e £ + 2P 2 



R2<\ l0g H 



* +SNR 2 



Q-2 

C 



A^o + 2P 2 



+ (C 2 B ! - rai) • 
Here we choose 



1 
2 

(11) 

r 

(12) 



e,,- = min <j (JV + 2P 2 ) ( 2 2C ^ - 2 ) ,Q 2 ,Pi~P 2 



and r 2 i=C(l + ] ^S^jl{C 2 B 1 >|}. 

Proof: Achievability will be detailed in this section. ■ 
Theorem 3.3 (Constant Gap to Optimality): The above in- 
ner and outer bounds are within (3, 1.5) bits for (R\, R 2 ). 

Proof: We combine Lemma |3.1| and |3.2| to obtain the 
theorem. See Appendix |B-B| for detailed gap analysis. ■ 

B. Achievability 

We shall only make use of the link from Tx2 to Txl, 
as suggested by the outer bounds. In addition to the above 
mentioned layers and £, due to the cooperation we in- 
troduce a third middle layer £, which denotes cooperation. 
Decompose messages w\ into (wisr, w m, wis) and w 2 into 
{w 2 <y\, w 2 <r, w 2 s). We keep the encoder and decoder archi- 
tecture as in the case without cooperation, except that now 
we split the encoder at Txl, £\, into three sub-encoders 
(£ ik, £ ie, £ic), and split the decoder at Rx, T>, into three 
sub-decoders T>£, T>^). Encoder £\$_ outputs signal Xij. 
Txl sends out the superposition of x^, Xie, and Xi£. Hence 
the receive signal can be written as 



y = 

Encoding 



Xl£ +Xu 



X 2£ + Si 



s 2 



1) Layer £: We use the same scheme as in the case without 
cooperation. The only difference is the effective interference 
known to Txl in this layer becomes si£ = si + xi<h + xxc. 

2) Layer £: In this layer, we use a lattice Ag with second 
moment 9c and basic Voronoi region Ve to modulate w±<t 



and w 2 £. Generate random independent codebooks of sizes 
2 JVii ie and 2 JVfl 2l £ accor ding to Unif (V c ) for w ie and w 2e 
respectively. Let the codewords be vie and v 2e -. Txl and Tx2 
would transmit xi<r and x 2e - respectively, using the following 
modulo-lattice operation, if they had enough power: 



Xic 



ctgSiir — die*] mod A^, £ = 1,2, 



where d^g's are dithers, Sjg-'s are effective interferences known 
to transmitters, and is the MMSE coefficient. However, 
since Tx2 has no power left (recall that in layer £ Tx2 has 
already used up its power), user 2's precoded signal has to be 
transmitted by Txl via cooperation. Therefore, dither d 2 £ is 
no longer needed because in this layer the received signal is 
solely contributed by Txl. 

The effective interference s 2 £ = s 2 . Tx2 first compresses 



x 2 c := [v 2 c - a £ s 2€ ] mod A £ 



(13) 



using a Gaussian vector quantizer: x 2 £ = x 2( r + z, and 
z ~ Af(0, AIjv) is independent of everything else. A denotes 
the quantization distortion. Note that the rate for Txl to 
recover x 2( r reliably is upper bounded by the rate distortion 
function assuming x 2 £ is Gaussian, since Gaussian distribution 
is the differential entropy maximizing distribution under power 
constraint. Let the rate for sending the compression index be 
r 2i, r 2i < Cfr Hence, we have the following criterion: 



r 2 i > ~ log ( J 



6e 
A 



(14) 



We shall set r 21 such that the above holds with equality if 

Cfi — \ 1°S (l + a )■ ^ not ' we s i m Ply drop this layer by 
setting r 2 \ = 0q; = 0. The value of A will be described later 
in the decoding part. 

Tx2 then sends the quantization point x 2 £ to Txl. Encoder 
£m outputs 

x ic = [v ic + 5i 2l r - - die] mod A c , 

where the effective interference sic = Si +xi^-. The value of 
a£ will be described later in the decoding part. 

3) Layer 5H: Layer 9\ is now shared between both users. Tx2 
uses the rest of the cooperation capacity (C^ — r 2 i) to send 
message w 2 yt to Txl. Txl uses the rest of the power, that is, 
Pi — 6c — P 2 , to encode messages (wi«k, w 2 <r), using dirty- 
paper coding or lattice strategies against interference s x . 
The encoder architecture at Txl is depicted in Fig. [3] 

Decoding 

1) Layer £: Decoder 2?£ remains the same as in the case 
without cooperation, and achievable rates are described in ((7J. 

2) Layer £: Decoder T><i takes the input y, rewritten as 

y = xif + sic + s 2C + z £ , 

and employs the same decoding procedure as decoder T>£. 
Here {s^, i = 1,2} are defined as above and Z£ = xi£ + 

X 2 £ + Z. 
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Fig. 3. Encoder Architecture at Txl (with Cooperation). Blue denotes the 
cooperation information 



The equivalent modulo-lattice channel output is 

y<£ = [y - (1 - etc) y + die] mod A £ 
= [vie + v 2C + z effi£ ] mod A £ , 

where ZeB,£ = z + oitrzg — (1 — ac)xi£. Hence the achievable 
rates are nonnegative (Rm, R2e) satisfying 



Rl€ + R2€ < ^ log 



e. 



i(N + 2P 2 ) + (1 - a e ) 2 e £ 



We shall choose a* = 7^ — , , „ D s to minimize the effective 
noise variance, which is now A+ Qr+^N + +2P 2 ) • ^ or simplicity 



we choose A 



^tt^t-t^s , and hence the achievable rate 
region consists of nonnegative (Ric,R 2 c) satisfying 



Pie 



i?2C < I C 



^ D + 2P 2 , 

As for the values of r%\ and Bg, note that ( fT4| ) 
becomes r 2 i > C ^1 + w ^_ 2 p, ) wriere the right-hand 
side is always lower bounded by 1/2. Besides, in layer 
9\ the codeword in layer €. is treated as noise, where 
the interference s 2 is also treated as noise. Hence, if 
> 1/2, we set r 2 i = C (l + N ^_%p 2 ) , and 6 e = 

min |(AT + 2P 2 ) ^2 2C 2i - 2V Q 2 , Pi - P 2 \. Otherwise, we 
set r 2 i = 0£ = 0. 

3) Layer Decoder 1?^ uses the same procedure as in the 
case without cooperation to decode u> 2 <r)- The only 

difference is the effective noise term z<h becomes xi£ + xi£ + 

X 2 £ + S 2 + Z. 

Similar to the previous argument in the case without cooper- 
ation, we obtain the achievable rate in this layer: nonnegative 
Pim,P2!K satisfying 



R 



15R 



R2m ^ C 



Pi - gg - P 2 
N + 9 C + 2P 2 + Q 2 



P2K < C 21 — r 2i . 

Plug in R x = Pi£+Pic+Pik and P 2 = P 2 £+P 2 £+-R 2 m. 
and apply Fourier-Motzkin elimination to the above achievable 



We conclude the paper by the following remark discussing 
how to sharpen the gap. 

Remark 3.4: In the proposed scheme there are several 
points for future improvement. First, the cooperation link from 
Txl to Tx2 is not utilized, though it only provides a power 
gain. Second, Tx2 uses a suboptimal Gaussian VQ to compress 
X2£ defined in < |13) , which is because it is technically simpler 
to handle. The quantization distortion A is also a heuristic 
choice, which can be further optimized. 
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rate regions, we establish Lemma 3.2 



Appendix A 
Converse Proofs 

A. Proof of Lemma \2.1\ 

1) R 1 + R 2 bound: 

If i?i and R 2 are achievable, by Fano's inequality and data 
processing inequality, we have 

N (Ri + R 2 -e N )<I (w u w 2 ; y N ) 



(a) 



I{wi,w 2 ,s i { ,s$;y )-I(s 1 ,s 2 ;y \wi,w 2 ) 



(b) 



h{y N )-h(z N )-h(s^s^\ Wl ,w 2 ) 
+ h(s?,s»\ Wl ,w 2 ,y N ) 



( = } - N]o o 



{2TTe) i N Q 1 Q 2 /2 



+ h(y N )+h{ S »,s»\w u w 2 ,y N ) 



(15) 
(16) 
(17) 
(18) 

(19) 
(20) 



(*) 



where e^v — > as N — > oo. (a) is due to chain rule, (b) is due 
to the fact that (x± ,x 2 ) is a function of (u>i, w 2 , sf , sf). 
(c) is due to the fact that w 2 , sf\ s 2 } are mutually 
independent. 

Note that the term (*) can be upper bounded as follows 



h(y")+h(s?, S »\ y ^ 



h(y 



N\„N „N 



, S f)+ft(sf,s^ 



<ylog 



(2^e) 3 (^ + P 1 +P 2 )Q 1 Q 2 



(21) 



This gives the outer bound ([T}. 

On the other hand, (*) can also be upper bounded as follows 

(*) = ft (y N ) + h (sf K ,w 2 ,y N )+h (sf K ,W2,V N , s?) 

< ft (y N ) + ft (sf |^) + ft (s»\wi,w 2 ,y N , sf ) 

® h (y N \s?) + ft (sf ) + h (of + z N \ Wl ,w 2 ,y N , sf ) 
<h(y N \sf)+h(sf)+h(xf + z N ) 



N i 

< — log 

— 2 



(2^e) 3 (A + Q 2 + Pi + P 2 ) Qi (A + P 2 ) 



(22) 



(a) is due to conditioning reduces entropy, (b) is due to chain 



rule and the fact that y" = + x 2 + sf + s 2 + z N and 
the fact that xf is a function of (wi,sf). Hence, this leads 
to the outer bound pi. 

2) R 2 bound: Providing the state information (sfjsf) to 
the decoder, we obtain the clean MAC without transmitter 
cooperation, and the bound (|3]l is trivial. 

B. Proof of Lemma \3.1\ 

1) Rt + R 2 bound 

The first part of the proof follows the same line as the case 
without cooperation, from ( fT5] l to ( |20] >, and the upper bound 
on (*) in ( |2T| is replaced by 



N 



log 



(27re) 3 (n o + P 1 +P 2 + 2^F\P 2 ) Q 1 Q 2 



taking the correlation between xf and xf into account. This 
gives the outer bound (j8). 



On the other hand, the upper bound on (*) in ( |22| i is slightly 
modified as follows: 



(*) 



ft (y N ) + ft (s?\w u w 2 ,y N ) + ft (s?\ Wl ,w 2 ,y N , sf) 



(a) 



< ft (y N ) + ft (s?\y N ) + h (s»\ Wl ,w 2 , y N , sf , 4) 



1 [ s 2 ]t 2 i\wi,w 2 ,y 



N „N\ 



00 



< ft (y N \s?) + ft (sf ) + ft (s»\ Wu w 2 ,y N , sf , 4) 
H (4) 

ft (y N \s?) + ft (sf ) + ft (xf + z^K, w 2 , y N , sf , 4) 



(c) 



< 



ft (^|sf ) + ft (sf ) + ft (xf + z N ) + H (4) 

< — - log [(27Tef(N +Q 2 +P 1 +P 2 + 2 VPn^)Ql(N +P 2 )} + NC 21 . 

(a) is due to conditioning reduces entropy, (b) is due to chain 
rule and conditioning reduces entropy, (c) is due to the fact 



that y 



N 



"f + xf + sf + sf + z N and the fact that xf 



„N 



is a function of (wi,sf , t^i). Hence, this leads to the outer 
bound (|9]l. 



2) i? 2 bound 

Providing the state information (s 



1 i s 2j 



to the decoder, we 



obtain the clean MAC with transmitter cooperation. Then the 
cut-set bound gives ( |T0] >. 

Appendix B 
Gap Analysis 



A. Proof of Theorem 2.3 



Compare the R 2 bounds: 
0-(|5) = ^log(l + SNR 2 ) 
< ilog(l + SNR 2 ) 



log" 
log 



+ SNR; 

1 + SNR 2 



Compare the Ri+R 2 bounds: if INR 2 < 1+2SNR 2 , Q— © 
is upper bounded by 



log(l + SNR!+SNR 2 ) 



1 /1 + 2SNR 2 
- log 

2 5 V 2 



1 / l + SNRi + SNR 2 

2 ° g I 2 + 4SNR 2 



If INR 2 > 1 + 2SNR 2 , @ — (gj is upper bounded by 
l + INR 2 + SNRi +SNR 2 S 



log 



— log 



INR 2 

1 + SNRi +SNR 2 + INR 2 



2INR 2 

Hence the proof is complete. 



= 1. 



B. Proof of Theorem 3.3 
Case C®i < - 



2 • 



The inner bound ( fTT) becomes the same as the sum rate 
inner bound Q in the case without cooperation. Therefore it 
suffices to compare the outer bounds ([TJ with ^ and |2]) with 
(|9) respectively: 

© - CD < C (2(SNRi + SNR 2 )) - C (SNR X + SNR 2 ) < - 



Cf! < 1. 



Using the same argument in Section B-A ^ — (JTTJ < 1.5 
bits, and (|9) - |[TT) < 2 bits. 

The inner bound (L2) becomes ± log 4 " ( 1+2 f NR2 ) +Cf x , and 
hence (QO) - (Q2) < 0.5 bits. 

Case C 21 > -' 



2 • 



We shall distinguish into three cases. 

1) e € = (N + 2P 2 ) (2 2C 2i - 2^: In this case r 21 
The inner bound ( |T2] > becomes 



^21 ■ 



1 



log 4 



1 + 2SNR 2 



^logf2^-l)4V 



1 



> ilog(l + 2SNR 2 )-l + ilog (2 2C " 

> i log (1 + SNR 2 ) - 1 + Cfj. - 1/2 = ([TO) - 3/2. 



Hence the gap is upper bounded by 1.5 bits. 

Since Q 2 > 9c, the inner bound ( fTT| is lower bounded by 

Jlog(l + 2SNR 2 )-3/2 + C 2 B 1 



1 



log 



1 + INR 2 + SNR 1 +SNR 2 
1 + 2SNR 2 + 2INR 2 



(**) 

If 2INR 2 > 1+2SNR 2 , the term (**) is lower bounded by 

1 lQg A+INR 2 +SNR 1+ SNR 2 Y ^ hence ^ gap tQ ^ 

bound Q is upper bounded by 3/2 + ^ log 8 = 3 bits. 
If 2INR 2 < 1+2SNR 2 , the term (**) is lower bounded by 

\ l0 § ( 1 2( S 1+ F 2 1 SN S r!)' 2 ) ' aIld heI1Ce the § a P t0 0Uter bOUIld 

ffify is upper bounded by 3/2 + | log 4 = 2.5 bits. 



2) ®£ = Q 2 : In this case r%i = C ( 1 
bound ( p~2^ > becomes 



inr 2 

1+2SNR2 



log" 1 



*-21 



1 + 2SNR 2 
2 

cd 



C 



INR, 



1 + 2SNR 2 



The inner 



INR 2 



1 + 2SNR 2 



> - log (1 + SNR 2 ) + - 3/2 = (TO) - 3/2. 

Hence the gap is upper bounded by 1.5 bits. Analysis of the 
gap from the inner bound ( fTT| to outer bounds ([8]) |9]) follows 
the same argument as that in 1). 



3) 0£ = Pi — P 2 : The inner bound ( fT2| ) becomes inactive 
since it is greater than the inner bound ( fTT) , which is lower 
bounded by 



1. /1 + 2SNR 2 

- log 

2 6 V 2 



1 /l_fSNR 1 +_SNR 2 

2 ° g I 1 + 2SNR 2 



1 



log(l + SNRi+SNR 2 )-l. 



It is within 1.5 bits to the outer bound dSJ. 

Combining the above analysis, we complete the proof of 
Theorem 13.31 



Appendix C 
Proof of Claim [2~41 

Use the random binning scheme in [2], the rate I (U; Y) — 
I (U; S) is achievable. We choose U = X+aS, X ~ Af(0, P) 
and independent of S, and a — p ^ N , as in [1|. Therefore, 
X + S = E [X + S\ U] + Z' = rU + Z f , where Z' is Gaussian 
and independent of {U, Z'}, Rewrite Y = rU + Z' + Z. 

Note that / (U; S) is a fixed number that does not depend on 
the distribution of Z. We focus on lower bounding / (U; Y). 
Use the argument that Gaussian noise is the worst case noise 
in an additive noise channel [16] and note the Y is the channel 
output with input U and additive noise Z' + Z, we conclude 
the I (U; Y) is minimized when Z is Gaussian. Combining the 
classical dirty-paper coding result UJ we complete the proof. 



